Docket Number: U PN-41 05 (M2351 ) 
Title: Compositions and Methods of Using Capsid 
Protein from Flaviviruses and Pestiviruses 
Inventors: David B. Weiner and Joo-Sung Yang 
Sheet 10 of 30 



Sequence Rang*: 1 to 5364 



>SglII 

1 

10 | 20 30 40 50 60 70 80 90 100 

GACGGATCGGG3U»TCTCCO^Tra 

CTGCCTAGCCCTCT*CakGGGCTAGGGGATACCAGCTGAGAGTCATGTO 

>M£eI 

i 

110 120 130 140 150 160| 170 180 190 200 

GGAGGTCGCTGAGTASTGCCCGAGCAAAATTTAAGCTACAftCftAGGCAAiG^^ 
CCTCCAGCGACTCATCACGCGCTCGTTTTAAATTCG^ 

>Sp«I 
I 

>CHV_prcoaoter ( 
i i 
210 220 230 240 250 260 270 280 290 300 

CTGCTTCGCGATGTACGGGCCAGATATACGCGTTGACAT^ 
GACGAAQCQCTACATGCCCGGTCTATATCKGCAACTCT 

310 320 330 340 350 360 370 380 390 400 

TGGAGTTCCGCGTTACATAACTTACGG^ 

>Nd«I 

i 

410 420 430 440 450 460 470 480 ] 490 500 

AACGCCAATAGGGACTTTCCATTGACGTCA* 
TTGCGGTTATCCCTCAAAGGTAACTGCAGTTAC^^ 

>SnaBI 
I 

510 520 530 540 550 560 570 580 590 600 

CCTATTGACGTCAATGACGOTAMTGGCCCCXX^ 
GGATAACTGCAGTTACTGCCATTTACC<3^^ 

610 620 630 640 650 660 670 680 690 700 

TCGCTATTACCAT««X»TGCGGTTTra 
AGCGATAA7GCTACCACTACGCCAAAACCGTCA3CTAGTT 

710 720 730 740 750 760 770 780 790 800 

TGGGAGTTTGrrTrGGCACCAAAATCAACG^^ 
ACCCTCAAACAAAACCGTGX3TTTTAGTTGCCCTCA 




>T7_pr«noter 

I 

830 840 850 860 { 870 880 890 900 

TAACTAGAGAACCCACTOCTTACI^^ 
CAGATATATTCGTCTCGAGAGACCGATTGATCT^ 

>HindIII >PflMI 

! 

>Kozak_sequ«nca 

1 I 
>Kindtt 1.1 inker, t Split] 

fl t 

fi 910 920| 930 940 950 960 970 980 990 1000 

TAAGCTTOCCGCCACCATGGATTCGACTTGGATCTTATTTTTAGT^ 
ATTCGAACGCWSOTGGTACCTAACCTGAACCrAGAATAAAA^ 

KDWTWILPLVAAATRVHS> 
SIGEY > 

SKXPGGPGKS> 
WNVCHU* > 

910 920 930 903 TO 1335 OP A. PCV/HB. SIGEY -WNVCHU_0 980 990 1000> 

10 20 30 4_5 TO 437 OF SIGEY -WNVCHU* 70 80 90 100 > 

10 1 TO 54 OP SIGEY 40 50 > 

>Apal 

I 

>Bspl20I 

i ! 

1010 1020 1030 1040 1050 1060 1070 1080 1090 i U00 

CGCGCCGTGAAG&TGCTGAAGCGCGGGATQCCCCGCGTGCTGAGCC^ 
GCGCGGCACTTGTACGACTTCGCGCCGTACGGGCK 
RAVNMLKR GMPRVLS h IGLKRAMLSt, I D G K G P I> 

WNVCHU* > 

_1010 1020 1030.903 TO 1335 OF A. PCV/HB. SIGEY-WNVCHU_70 1080 1090 1100> 



110 120 130 1_5 TO 437 OF SIGEY-WNVCHU* 170 180 190 200 — > 

>SfiI 
I 

1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 

GCTTCGTGCTGGOXTGCTGSCCT^^ 
CGAAGCACGACCGGGACGACCGGAAGAAG 

RPVLALLAFFKFTAIAPTRAVLDRWRGVHKOTAH> 
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4510 4520 4S30 4540 4550 4560 4570 4580 4590 4600 

GTGCTACAGAGTTl'inS3AAGTGGTGGCCTAACT^ 

46i ° *«0 4630 4640 4650 4660 4670 4680 4690 4700 

AGTT<OTAGCrCTT6fcTCCGGCAAACAAACC^ 
TCAA<X^TCGAGIACT*GGCCGTrTGTTTG<?r^ 

471 0 47 20 4730 4740 4750 4760 4770 4780 4790 4800 

^TCCTTT(»TCTTTTCTACGGGGTCTGACGCT^ 
CTAGGAAACTJUSAAAAGATGCCCCAGACTGCGACTCACCTTGCT 

I 810 "SO 4830 4840 4850 4860 4870 4880 4890 4900 

TCCTTTTAJ^TTM^TGAAGTTTTAAATCAATCT 
AGGAAAATTTAATTTTTACTTCAAAATTOAGTTAGATTTC 

>AhdI 

4910 4920 4930 4940 | 4950 4960 4970 4980 4990 5000 

AGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGAC^^ 
TCGCTAGACAGATAAAGCAAGTAGGTATCAACGG^ 

5010 5020 5030 5040 5050 5060 5070 5080 5090 5100 

ATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAA 

TATGGCGCTCTGGGTGCGAGTGGCCGAGGTCTAAATAGTCGTTATTTGGTCGGTCOOCCTT^ 

5110 5120 5130 5140 5150 5160 5170 5180 5190 5200 

C2£TCCAGTCTArcAATTGTTGC^ 

GOTAGGTCAGATAATTAACAACGGCCCTTCGATCTC^Tra 

acgctcgtcgtttgctatggcttcatt^ 5280 5290 5300 

TQCGAGCA(KyVAACCATACCGAAGTAAirCCGAGGCCAA^ 
>Pvut 

I 

5310} 5320 5330 5340 5350 5360 5370 5380 5390 S400 

CCAG<*AGGCTAGCAACAGTCTTCATTGAACCG(K 

>scal 

I 

5410 5420) 5430 5440 5450 5460 5470 5480 5490 5500 

GCTTTTCTGTGACTGGTGACTAC^^ 
CGAAAAGACACTCACX3UrraTGA^ 

5510 __ 55 20 55 30 5540 5550 5560 5570 5580 5590 5600 

GCCAC^TAGCAGAAC^ 
CGGTGTATCGTCTTGAAATTTTCACG^ 

S610 S62( J 5630 5640 5650 5660 5670 5680 5690 5700 

CCCACTCGTGCACCCAACTGATCTTCAGCATCTT^^ 
<3GGTGAGCACGTGGGTTGACTAGAA<^^ 

>SflpI 

5710 5? 20 5730 5740 | 5750 5760 5770 5780 5790 5800 

CCCGCTGTXXXrrTTACAACTTATGAOTATGAGAAQGAAA^ 

5810 5820 5830 5840 5850 5860 

TATTT AGAAAAATAAACAAA TAGGGCSTTCCGCGCACATTTCCCCGAAAAGTGCCaCCTGACGTC 
ATAAATCTTTTTATTTGTTTATCCCCAAGGCGCGT^ 
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_1130_903 TO 1335 OF A . PCV/HB . SIGSY-WNVCHU_7 0_ 



_210 220 230 2J> TO 437 OP SIGEY-WNVCHU* 270 280 290 300„> 



1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 

GAAGCACCTGCTGAGCTTCAAGAAGGAGCTGGGCAC^ 
CTTCGTQGAeG»CT(»AJ^TlCin ^^ 

KHLI#SPKKSLGTLTSAIMRRSSKQKKRGGKTGI> 

WNVCHU* > 

_1210 1220 1230.903 TO 133S OP A. PCV/HB. SIGEY~WNVCHU_70 1280 1290 1300> 



310 320 330 3_5 TO 437 OP SIGEY-WNVCHU* 370 380 390 400 — > 

>BstEII 
I 

>PaeR7i | 
I I 

>NotI >XJlOl | >BStBI 

! I i 1 
>Not I_iat roduc t i on_ I Spl i t J >VS_epi tope 

Mi! II 

1310 1320 1330 II 1340) 1 1350 j 1360 1 1370 1380 1390 1400 

GCCGTGATGATTGGCCTGATCGCO^GTGGGCG^ 
CGGCACTACTAACCGGACTAGCGGTCGCACCCGCGCCGGCGA 
AVMIGLIASVGA> 



903 TO 1335 OF A. PCV/HB. SIGE > 

5 TO 437 OF SIGEY-WNVCHU* > 

>AgeI >PmeI 

I 1 

>Polyhistidine_tag | >BGH - polyadenylation_signal 

II I I 

11410 ) 1420 1430 )1440 1450 | 1460 1470 1480 1490 1500 

CGCGTACCXjGTCATCATCACCATCACCATTGAG^ 
GCGCATGGCCAGTAGTAGTGGTAGTGGTAACTCAAATTTGGGCGACT 

1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 

CGTGCCTTOnrrGACCCTGGAAGG 

GCACGGAAGCAACTGGGACCTTCCACGGTGAGGGTGAGAGGAAAGGATT^^ 

>BbaI 

I 

1610 1620 1630 1640 1650 1660 1670 1680 1690 1700 

GGGGGTCGGGTGGGGCAGGACAGCAAGGGGGAGGATTGG 
CCCCCACCCCACCCCGTCCTGTCGTTCCrc^^ 

1710 1720 1730 1740 1750 1760 1770 1780 1790 1800 

CCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCGCATTAAGCG 
GGTCGATCOTGAGATCCCCCATAGGGGTGCGCGGGACATCGCCG 

1810 1 820 1830 1840 1850 1860 1870 1880 1890 1900 

CGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCG 
GCGGGATCGCGGGCGAGGAAAGCGAAAGAAGGGAAGGAAAGAGCG 

>DraIII 

I 

1910 1920 1930 1940 1950 I960} 1970 1980 1990 2000 

cgatttagtgctttacggcacctcgaccccaaaaaac 
gctaaatcacgaaatgccgtggagctggggttttttg 

2010 2020 2030 2040 2050 2060 2070 2080 2090 2100 

CGTTGGAGTCCAe G^Vm ^AATAGTGGA ^ 
GCAACCTCAGGTGCAAGAAATTATCACCTGAGAACAAGGT 

2110 2120 2130 2140 2150 2160 2170 2180 2190 2200 

GATTTCGGCCTATTGCTTAAAAAATGAGCTGATTT^ 
CTAAAGCCQGATMCCAWTTTTTACTCGjyCTAAATTGTTTTTAAATTGCG 

2210 2220 2230 2240 2250 2260 2270 22B0 2290 2300 

ATACGTTT 

2310 2320 2330 2340 2350 2360 2370 2380 2390 2400 

GCATGCATCTCAATTAOTCAGC»ACCATASTC^^ 
CCTACCTAGAGTTAATCAGTCGTTGGTATCAGGGOSGGG^ 

>avtii 

I 

>StuI 

2410 2420 2430 2440 2450 2460 2470 2480 ! I 2490 2500 

ACTAAllUU'lTinATTOATGCAGACGCCQAGGCCg^ 
TGATTAAAAAAAATAAATACGTCTCCGGCTCCGGCGGAGA^ 

>S»ai 

I 

>a»i >BaaBI 

! 1 ' 

12510 2520 2530 2540 2550 | 2560 2570 2580 2590 2600 

AAGCTCCCGGGAGCTTGTATATCCATTTTC 
TTCGAGGGCCCTCGAACATATAGgrAAAAGCCTAC 



